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The statistical property of a growing scale-free network is studied based on an earlier model 
proposed by Krapivsky, Rodgers, and Redner [Phys. Rev. Lett. 86, 5401 (2001)], with the additional 
constraints of forbidden of self-connection and multiple links of the same direction between any two 
nodes. Scaling exponents in the range of 1-2 are obtained through Monte Carlo simulations and 
various clustering coefficients are calculated, one of which, C out , is of order 10 _1 , indicating the 
network resembles a small-world. The out-degree distribution has an exponential cut-off for large 
out-degree. 
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I. INTRODUCTION 

To study the statistical property of a complex system 
composed of many interacting individual components, it 
is often helpful to map the system into a network of nodes 
and links (edges). Each node in this network represents 
one component of the real system and the interaction, 
if there is any, between two components is denoted by 
an edge, either directed or undirected, between the two 
corresponding nodes in the network. One quantity of 
interest is the node-degree profile of the formed network: 
How does n(k), the total number of nodes with a given 
number k of links attached (the node degree), scales with 
fc? Empirical observations revealed that many social and 
biological networks have the scale-free property 0, 
that is, 

n(k) ~ k~ v (1) 

as k becomes large enough. The scaling exponent v is 
typically in the range of 2 < v < 3; but there are evi- 
dences that some networks have scaling exponents in the 
range of 1-2 while a few other networks have scaling 
exponents larger than 3 (for a collection of experimental 
data, please refer to Table II of 0). 

To explain the scale-free characteristic, one appealing 
mechanism is to assume that the network (1) keeps grow- 
ing and, (2) during this growth process, new edges are 
generated and are attached preferentially to those nodes 
which have already been attached by a large number of 
edges 0. Based on this mechanism several models have 
been suggested 0) IS > but they predicted the scal- 
ing exponent v to be greater than 2, thus failed to ex- 
plain the behavior of those networks with smaller v. In 
Ref. this "preferential attachment" mechanism was 
questioned partly because of this apparent discrepancy 
between theory and empirical data. In Ref. it was 
shown that if one assumes a network is growing acceler- 
ately, it is possible to generate scaling exponent in the 
range between 3/2 and 2. However, it is still not clear 
whether or not this condition is absolutely necessary to 
explain experimental observations. 

An emerging property of almost all the so-far stud- 
ied scale-free networks is that they can at the same time 



be classified as small- world networks @- That is, (i) 
the diameter of the network scales as ln(7V), where N 
is the total number of nodes in the system, and (ii) the 
clustering coefficient C is independent of N and is thus 
much greater than that of a random network (~ (k) [N, 
where (k) denotes the average node-degree of the net- 
work). On the theoretical side, it was confirmed that 
growing networks generated by the mechanism of prefer- 
ential attachment will typically have diameters scales as 
In (AT) (see, for example, Ref. |l(|). However, the original 
Barabasi- Albert model [l|, |2| predicted a very small clus- 
tering coefficient C ~ N~°- 75 . The clustering coefficients 
for other models 0. were not reported. 

To improve our understanding on scale-free networks, 
in this work two questions are addressed: Will it be possi- 
ble to generate a scale- free network with scaling exponent 
v < 2 based on the preferential attachment mechanism? 
Will it be possible for a scale-free network generated this 
way to have relatively constant clustering coefficients? 
We answer these questions confirmatively by studying a 
revised Krapivsky- Rodgers-Redner model |4| with Monte 
Carlo simulation approach. 



II. THE GROWING NETWORK MODEL 

The Krapivsky-Rodgers-Redner model |4] is a gener- 
alized version of the original Barabasi- Albert model on 
scale- free networks 0. It has the following key elements: 
(i) edges are directed; (ii) new nodes are added into the 
network and are attached preferentially to existing nodes 
with larger in-degrees; (iii) creation of edges between 
"old" nodes are possible and a newly created edge also 
prefer to attach to nodes with larger degrees. This model 
is general in the sense that it takes into account direc- 
tional interactions of the real network systems, and that 
the growth of the network is not solely caused by the in- 
clusion of new nodes but also as a result of the increased 
interactions among the existing nodes of the system. It 
can be corresponded to the real systems including the 
World-Wide Web (WWW), the Internet, the food web, 
the transportation network, the e-mail network, etc.. Be- 
cause of its generality, we re-examine this model in the 
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FIG. 1: Self-connection (a) and Multiconnection of the same 
direction (b) are forbidden in the present simulation. 



present work to illustrate the property of growing scale- 
free network. 

We noticed in the original Krapivsky-Rodgers-Redner 
model the growth process permits the following two 
possibilities illustrated in Fig. ^ (1) a directed edge 
can originate from and end into the same node (self- 
connection) and (2) there can be more than one edges 
of the same direction between two nodes (multiconnec- 
tion). Allowing these two possibilities makes analytical 
calculations possible and these authors found that in the 
growing network, both the in-degree and the out-degree 
distribution follow the power-law [4j: 



n- m (k) ~ k 



2 < v XXL = 2 + pX < oo, 



(2) 



n ont (k)~k 2 < i/ out = l+l/(l-p)+np/(l-p) < oo, 

(p, A and /i are the three adjustable model parameters). 
Although the permission of self-connections and multi- 
connections might be reasonable for some networks (such 
as WWW pages, in which a page can have several links 
to another page and it can have a link bring the reader 
from one portion to another portion of the same page), it 
may fail for other kinds of networks (in a co-authorship 
network ^l|, it is meaningless for a author to be the co- 
author of himself/herself; and in a food web [lj, there 
is at most one edge of the same direction between two 
species). Because of the preferential attachment mecha- 
nism, if a node already has a large number of incoming 
and outgoing edges, it has a good possibility to form self- 
connections and by doing so its dominance is further am- 
plified; allowing multiconnection has similar effects. As a 
result, edges may concentrate on just few nodes, making 
the scaling behavior steeper. In the present work, the 
two kind of edges listed in Fig. are discounted. Be- 
cause of the reasoning outlined above, scaling exponents 
1 < v < 2 might occur in a growing network without 
self-connection and multiconnection. This point will be 
checked by Monte Carlo simulation. 



III. MONTE CARLO SET-UP 

The revised Krapivsky-Rodgers-Redner model is stud- 
ied by MC simulation. Started with a single node, at 
each step: 

(i) With probability p, a new node is created and a di- 
rected edge from it to an existing target node (3 
is setup. Of all the N existing nodes, /3 will be 
selected with probability 4] 



attach 



(/?) 



fcin(/?)+A 

E + XN 



(4) 



In the above equation, h m {f3), the in-degree, is the 
total number of incoming edges of node (3; A is a 
constant signifying the "initial attractiveness" of a 
node ; and E is the total number of edges in the 
system before this new edge is created. 

(ii) With probability q — 1 —p, a new edge pointing from 
one node a to another node (3 is created, provided 
that (1) E < N(N - 1), (2) a and (3 are not iden- 
tical, and (3) there is no any preexisting directed 
edge from a to (3. The probability that a and j3 
will be selected is governed by the probability 



Pr, 



t(a,)9) 



[fcout(a) +p][hn{f3) + A] 
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t ( 7 ) +M p in (J) + A] 



(•5) 



where &; ou t(a) denotes the out-degree of node a; fi 
is another constant with similar physical meaning 

as A; ^2 denotes the summation over all the nodes 

s 

S which is not yet approached by a directed edge 
from node 7. 

For large system size N it turns out to be quite inef- 
ficient and complicated when performing procedure (ii) 
based on a direct application of Eq. 0. This is partly be- 
cause of the fact that, after a new edge has been created 
between node a and 0, one must update the value of the 
summation in Eq. JSJ by 0{N) iterations. To speed up 
procedure (ii) , the selection of two nodes and the connec- 
tion of an edge between them is finished actually through 
the following way: 

(1) Select a outgoing node a with probability [k ou t(a) + 

fx]/(E + pN); 

(2) Select an in-coming node (3 with probability [ki n (/3) + 

X]/(E + XN); 

(3) If a and (3 are identical, or if there is already a di- 

rected edge from a to (3, repeat steps (1) and (2); 
if else, accept a and (3 and update the system. 

It is not difficult to prove that by this method the proba- 
bility for nodes a and f3 be chosen is identical to Eq. 

The algorithm code is written in CH — h language 
with some of its standard containers (including map and 
set) being exploited. 
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To estimate the scaling exponents from the simulated 
data, we use two methods. One can directly fit the data 
with Eq. n Alternatively, one can define the cumulative 
degree distribution by 



P{k) = n ( k ') ~ k- {u - x) , 



(6) 



k'>k 



and from the cumulative distribution data an estimation 
of the value of v could be obtained. 



IV. DEGREE DISTRIBUTION OF THE 
GROWING NETWORK 




In the growing network model, there are three ad- 
justable parameters, namely p, A, and \x. Figure [21 shows 
the relations between the average number of nodes and 
the in-degree and out-degree for both the original and 
the revised Krapivsky-Rodgers-Redner model. Figure [31 
shows the corresponding cumulative degree distributions 
for the two models. The network is the result of N = 10 6 
growing steps. 

At p = 0.133334 (a new node will be included on av- 
erage every 7.5 steps), A = 0.75 and \i — 3.55 the orig- 
inal model Q predicts v xn = 2.1 and ^ ut = 2.7. From 
the MC data we obtain that v m = 2.066 ± 0.014 and 
2/ ou t = 2.626 ± 0.036, in close agreement with the an- 
alytical values. At these same parameters, the revised 
model has f in = 1.925 ± 0.007 and v ou _ t = 2.269. Thus, 
excluding of self-connection and multiconnection leads to 
decreased values for the scaling exponents. Other quan- 
titative differences are: 

(1) In the revised model there is a cut-off in the in-degree 

distribution: no node has in-degree k > 9 x 10 3 . 
While in the original model, there are nodes with 
in-degree as large as k — 4 x 10 5 . 

(2) In the revised model the out-degree distribution has 

an exponential cut-off around k — 250; while in the 
original model, there are nodes with out-degree as 
large as k — 1.5 * 10 4 . 

(3) The value of n{k) is much larger in the revised model 

than in the original model for a given k (less than 
the cut-off value). This holds both for the in-degree 
distribution and for the out-degree distribution. 

These observations lead to the following picture: By 
prohibiting self-connection and multiconnection, edges 
which originally belong to just few "suppernodes" are 
now redistributed to those nodes of small or intermediate 
in- and out-degrees. Consequently, the number of nodes 
with small and intermediate node-degrees increases con- 
siderably, resulting in a smaller scaling exponent in the 
power-law decrease of the distribution and a cut-off in 
the tail of this distribution. 

In Fig. 0] we demonstrate the simulation result when 
the probability of node addition is changed to p = 0.05 
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FIG. 2: The profiles of in-degree (A) and out-degree (B) 
distribution at p = 0.133334, A = 0.75 and [i = 3.55, after a 
growing process of 10 6 steps. Square symbols are the data for 
the original Krapivsky-Rodgers-Redner model and diamonds 
are the data for the revised model. Each data point is the 
average over 20 (diamonds) or 10 (squares) realization of the 
network. The thin solid line has a slop of —2.066 in (A) and 
— 2.626 in (B). The thin dashed line has a slop of —1.925 in 
(A) and —2.269 in (B). The average number of nodes in the 
revised network is 133271, and the average number of edges 
is 999984. 



(a new node will be included on average every 20 steps) 
while the other two parameters are kept the same values 
as in Fig. [21 In these parameters, the original model 
predicts v m = 2.04 (theory) and 2.018 ±0.015 (MC) and 
v out = 2.24 (theory) and 2.190 ±0.014 (MC); while the 
revised model has an in-degree exponent v lu = 1.672 ± 
0.003 and an out-degree exponent fy out = 1.764, both of 
which are markedly smaller than 2. 

Therefore, the exclusion of self-connection and multi- 
connection can change the scaling exponent of the scale- 
free network dramatically when each node has a relatively 
large average node-degrees. It can be anticipated that 
similar behavior will be observed when the initial attrac- 
tiveness parameters, A and /i, of each node are varied. 
It is therefore possible for the present model to explain 
networks with scaling exponent v < 2. 

However, could the observation of scaling exponents 
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FIG. 3: In-degree (A) and out-degree (B) cumulative distri- 
bution for the data sets in Fig. [5] The solid lines corresponds 
to the original model, and the dashed lines to the revised 
model. From these curves, we estimate z/j n = 2.066 ± 0.014 
(the original model) and v iTL — 1.925 ± 0.007 (the revised 
model). v ou t — 2.626 ±0.036 for the original model. The out- 
degree cumulative distribution of the revised model does not 
fit well to the power-law. Therefore, the out-degree scaling 
exponent is not estimated by this method but by a direct fit 
to the distribution in Fig. [5)3, resulting in h> out = 2.269. 



1 < v < 2 be an artifact caused by finite-size effects? In 
Fig. O the calculated in-degree scaling exponent is plot- 
ted as a function of the total growth steps. This figure 
strongly indicates that even for an infinite system the 
scaling exponents will still be less than 2. 

Adamic and Hubcrman studied the WWW by map- 
ping each web domain (rather than each web-page) as a 
single node, and they reported an in-degree scaling ex- 
ponent of v m = 1.94. Mossa and co-authors, upon their 
reinterpretation of the WWW data of Barabasi et al. , 
reported an exponent of v = 1.25 0]. The e-mail net- 
work studied in Ref. has a scaling exponent in the 
range 1.47 < v < 1.82. And even smaller scaling expo- 
nents are reported in several other networks 2] . To attain 
quantitative agreement to these empirical data, one need 
nevertheless more information to help fixing the values 
of the adjustable parameters. 

A persistent property of the revised network model is 
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FIG. 4: The profiles of in-degree (A) and out-degree (B) dis- 
tribution at p = 0.05, A = 0.75 and /i = 3.55 and 10 6 growing 
steps. Squares (averaged over 10 realizations) correspond to 
the original model and diamonds (averaged over 20 realiza- 
tions) to the revised model. The thin solid line has slope 
-2.018 in (A) and -2.190 in (B). The thin dashed line has 
slope —1.672 in (A) and —1.764 in (B). The average number 
of nodes is 49831 and the average number of edges is 999860. 



that there is a rapid decay in the out-degree distribution 
(which occurs at k ou t — 400 in Fig. 0)3). Such a rapid de- 
cay was not observed in the original model. This feature 
is also absent in the in-degree distribution of both the 
revised and the original model, although the in-degree 
distribution of the revised model does have a cut-off for 
large k. Experimentally, it was reported that both the 
WWW network PUS' 

and the e-mail network show 
exponential cut-off in the node-degree distribution. 



V. CLUSTER COEFFICIENTS OF THE 
GROWING NETWORK 

As was mentioned in the introduction, many real scale- 
free network at the same time show small- world behavior 
: having small diameters and being highly clustered. 
For the original Krapivsky-Rodgers-Redner model, it has 
been reported in Ref. 10] that the average minimum path 
scales as ln(7V). We anticipate this to be hold also for the 
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actual number of edges existing between these edges. We 
define the incoming clustering coefficient as 



5^iactual(a)/[*in(«)(fcm(a) ~ 1)] 



AT' 



(8) 



where E indicates summation over all the nodes whose 

incoming edges is larger than 1, and N' is the total num- 
ber of nodes with this property. We find that C- m = 
0.0044. This value indicates that the degree of cliqueness 
of the upstream neighbors of a given node is usually very 
small. For a completely random graph, C- m — 5.63 xlO -5 . 



FIG. 5: The relation between the in-degree scaling exponent 
and growing steps for the original (squares) and the revised 
(circles) model. The parameters are set equal to that of Fig. 01 
namely, p = 0.05, A = 0.75 and /i = 3.55. The thin dotted 
line indicates the theoretical prediction of v 1VL = 2.075 for the 
original model. 



revised model. Here we focus on the clustering charac- 
teristics of the revised model system, with the parame- 
ters setting equal to those of Fig. i.e., p — 0.133334, 
A = 0.75 and // = 3.55, and a total 10 6 growing steps. 



A. Mutual-connection coefficient C mu t U ai 

Denote Gdown(ct) as the complete set of nodes which 
arc the "downstream" neighbors of node a, namely 
there exists a directed edge from a to each nodes in 
Gdown{ct)', similarly we define G up (a) as the complete 
set of nodes which are the "upstream" neighbors of node 
ct. |Gdown(a)| means the size of set Gdown(a). 

The mutual-connection coefficient is defined as 

r _ Eg |Gdown(oQn G up (oQ | /|Gdown(tt) | f 7 s 

^mutual — „ j \ > ) 

which signifies to what extent the "downstream neigh- 
bors" of one node intersect with its "upstream neigh- 
bors" . The value of G mu t U ai averaged over 20 realization 
of the growing network is 0.0010. This indicates that 
the interaction between one node and its "neighbors" in 
the network is usually not bi-directional. However, this 
value is still much larger than the value for a random 
network of the same size (N = 133271) and the same 
average degree of out-going edges {{k) = 7.50), for which 
G mutua i = 5.63 x 10- 5 . 



B. Incoming clustering coefficient Cm 

Suppose a given node a has in-degree fc; n (a). The 
maximal number of edges existing between the nodes in 
G up (a) is k- m (a)(k- m (a) - 1). Denote i a ctuai(a) as the 



C. Outgoing clustering coefficient C ou t 

The definition of the out-going clustering coefficient 
G out is similar to that of C- m . Suppose a particular node 
a has k out (a) out-going edges, and i a ctuai(a) is the total 
number of edges between the nodes in Gdown(a), then 

X^actual(a)/[fcout(a)(&out(>) - 1)] 

Gout = — jp, , (9) 

where N" is the total number of nodes whose out-degree 
is larger than 1. We find G ou t = 0.229 for the present 
growing network. Compared with the small values of 
Gmutuai and G m , such a large value of G ou t is surprising. 
It suggests the average interaction between the nodes 
which are in the downstream group of a given node is 
considerably strong. How to understand this kind of 
asymmetry, namely 1 ~ G ou t S> Gj n ? We suggest the fol- 
lowing possibility: In the network, there are some nodes 
which are so popular that a large population of the whole 
nodes will have an edge pointing to them (see Fig. EK)- 
Consequently, these nodes will have great possibility to 
belong to the downstream group of any particular node, 
and they will also have great possibility to be pointed 
to by other members of this group, making G ou t to be 
proportional to unity. However, the number of nodes de- 
cays quickly when the out-degree increases to about 250 
(see Fig. 12)3). Therefore, in the network there is no node 
which are so "generous" that it points to a large popula- 
tion of the whole network. This may make the value of 
Gin small. In other words, it might be the existence of an 
steep cut-off in the out-degree profile that accounts for 
the difference in the clustering coefficients G ou t and G m . 

D. Triangle coefficient Ctriangie 

For a particular node a, suppose node /3 £ Gd O wn(c>0- 
Then i&(a, (3) = |Gdown(a) f] Gdown(/?)| is the total num- 
ber of nodes that are pointed to by both a and (3. We 
define 

E E Ma,/3)/fcout(/3) 

r _ <* /3SG down ( a ) 

C-'trianglc — 77 ■ \ lv ) 
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The triangle coefficient Ctriangic signifies to what extent, 
if there is a directed edge from node a to node (3 and 
there is a directed edge from (3 to node 7, there will also 
be a directed node from a to 7. Calculation revealed that 

Ctriangic ~ 0.011. 

VI. CONCLUSION AND DISCUSSION 

In this work we have used a revised Krapivsky- 
Rodgers-Redner model to investigate the degree distri- 
bution of growing random network and to investigate 
whether such a kind of growing network could be re- 
garded as a small-world network. After excluding the 
possibility of self-connection (Fig.^^) and requiring that 
there is at most one directed edge from one node to 
any another node (Fig. ^3), the Monte Carlo simulation 
demonstrated that scale-free network with degree distri- 
bution coefficient v less than 2 can be generated. And 
it is also revealed that the average interactions between 
the nodes which are belong to the downstream group of 
a particular node is very strong, suggesting the growing 
network at the same time forms a "small world". The 
strong interaction in the downstream group of a partic- 
ular node was suggested to be closely related to the ex- 
istence of several "popular" nodes which are pointed to 



by a large fraction of the total population in the node 
system. Previous efforts often predicted that the scaling 
exponent v should be larger than 2, and there is still not 
many efforts to understand why many scale- free networks 
are at the same time small- world networks. It is hoped 
that the present work will help to improve our under- 
standing of the occur of scale-free networks with v < 2 
and to improve our understanding of the close relation- 
ship between scale-free and small- world networks. 

The present work suggests that, by excluding the possi- 
bility of self-connection and multiconnection, those many 
edges, which were associated with several nodes of ex- 
tremely large in- or out-degrees in the original Krapivsky- 
Rodgers-Redner network, are now redistributed to nodes 
of small or intermediate degrees. This may explain why 
a dramatic decrease in scaling exponents could be ob- 
served. 
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